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UNIFORM  IMPROVEMENTS  ON  THE  CERTAINTY  EQUIVALENT 
RULE  IN  A  STATISTICAL  CONTROL  PROBLEM1 

L.  Mark  Berliner 
Department  of  Statistics 
Ohio  State  University 
Columbus ,  Ohio,  U.S.A. 

1.  The  Control  Problem.  Tne  problem  considered  here  arises  when  it 
is  desired  to  choose  values  for  design  variables  in  a  linear  model  so 
that  the  resulting  dependent  random  variable  will  be  close  to  a  prescribed 
constant.  For  discussions  and  applications  of  this  problem,  see  Aoki  [1], 
Dunsmore  [6]),  Lindley  [7],  and  Zellner  [12], 

The  analysis  here  concerns  the  following  transformed  version  of  the 
problem  as  given  in  Basu  [2]  and  Zaman  [11].  Consider  the  linear  model 

y  =  ztB  +  e, 

2 

where  B  is  an  unknown  p-vector,  z  is  an  arbitrary  p-vector,  and  s  'v  N(0,a  ) ; 
2 

a  ,  possibly  unknown.  Furthermore,  an  estimate  (independent  of  e)  of  B, 
say  8,  is  available.  Assume  that  8  'V  N^(B,A),  where  A  is  a  known,  positive 

A 

definite  matrix.  The  goal  is  to  choose  a  controller  z^( B)  which  performs 
well  with  respect  to  a  control  risk  function  Rc  given  by 

ci.i)  Rc(zc’6)  =  E<y-y*>2 

=  E([zc(B)]tB  +  e  -  y*)2, 

where  y*  is  a  non-zero  constant,  and  the  expectation  is  taken  over  both 
S  and  £ .  Computation  in  (1.1)  yields 

(1.2)  Rc(zc’s)  *  °2  +  <y*>2EC<y*)~1CzcCB)3tS-a32 - 


1Research  supported  in  part  by  the  National  Science  Foundation  under 
Grant  MSC-7802300  and  by  the  Office  of  Naval  Research  under 
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Now,  consider  the  following  transformation: 

X 
0 


/r1/2a 


A-1/23 


6  =  (y*)‘1A_1/2zcCB). 

Application  of  this  transformation  in  (1.2)  yields  the  equivalent  problem 
of  choosing  a  decision  rule  6(x)  =  (6^(x) , . . .  ^(x))^,  based  on  an  observation 
X  'v  N  (0,1),  0  unknown,  subject  to  incurring  a  loss, 

(1.3)  L(6 ,0)  =  (e^-l)2. 

This  version  of  the  control  problem  will  be  considered  below. 

The  approach  here  is  decision  theoretic;  control  rules  are  evaluated 
in  terms  of  their  risk  (expected  loss)  functions,  R(6,6)  =  E.L(S(X) ,0) . 

U 

Our  interest  is  the  proposition  of  rules  which  dominate,  in  risk,  the  rule 


A  (x)  =  |x| 
m  1  1 


i  1 2  P  2 

( ]  x  I  =  I  ^x^ . )  This  rule  is  the  certainty  equi  valent 


controller.  For  brevity,  discussion  of  the  "certainty  equivalent  principle" 

(also  known  as  the  "separation  principle")  is  omitted.  The  reader  is 

referred  to  Aoki  [1],  Basu  [2],  and  Zellner  [12]  for  such  discussions. 

However,  in  the  spirit  of  this  principle,  note  that  the  loss  function 

implies  that  6  should  be  a  estimator,  in  some  sense,  of  the  quantity 
.  —2 

|  ©  i  6.  Clearly,  6^  is  the  maximum  likelihood  estimator  of  this  quantity. 

As  will  be  seen,  a  theorem  of  Zaman  [11]  implies  that  6  is  inadmissible 

m 

in  all  dimensions.  (In  fact,  note  that  <$m  has  infinite  risk  when  p  is 


1  or  2 . ) 


2_.  Previous  Results.  Most  previous  results  are  concerned  with 
spherically  symmetric  (s.s.)  rules,  i.e.,  rules  of  the  form  S(x)  =  'i'(|x|)x. 
Many  inadmissibility  results  are  based  on  the  asymptotic  behavior  of  s.s. 

y,  |1  £,(7  .ic  I  vl  -►  cn  Tn  rva'rYt' 1 iT  is  Y.  cci  'Y.TTOCCYY  crmyi  fv.nc+-anY  /"i 
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(2.1) 


l 


(2.1)  6(x)  %  (jx!2*-c)-1x 

lor  |x|  sufficiently  large.  The  essence  of  the  available  results  is  that 

if  c  >  j  -  p,  then  6  is  inadmissible.  (See  Zaman  [10]  and  Berger,  Berliner, 

and  Zaman  [4]  for  precise  theorems.)  Related  wcrk  of  Zaman  [10]  and 

Takeuchi  [9]  also  concerns  rules  of  the  form  (2.1).  In  particular, 

Zaman  [10]  showed  that  the  value  c  =  5  -  p  is  asymptotically  optimal 

2  —1 

(i.e.,  for  all  1 0 1  sufficiently  large,  6(x)  %  (|x|  ->-S-p)  Xx  has  the 

smallest  risk  of  all  rules  of  the  form  (2.1)).  Berger,  Berliner,  and 

Zaman  [5]  studied  the  admissibility  of  s.s.  generalized  Bayes 

controllers.  Of  special  interest  in  this  paper,  it  was  shown  that,  under 

suitable  regularity  conditions  on  generalized  prior  measures,  the  corresponding 

2  “1 

generalized  Bayes  rules,  that  are  given  by  <5(x)  %  (|xp+S-p)  x  for  |x| 
large,  are  admissible. 

Three  other  results  are  directly  applied  in  the  discussion  below. 

These  results  are  paraphrased  here.  The  reader  is  referred  to  the 
indicated  references  for  precise  statements. 

RESULT  1.  (Zaman  [11]).  If  <5(x)  =  ¥(  |x|  )x  is  admissible ,  then 

i)  0  <  nr)  <  1. 

ii)  lim  nr)  =  1. 
r+0 

,(“) 


iii)  'i'(r)  is  a  C 


function. 


Note  that  Result  1;  i),  ii),  imply  that  6  is  inadmissible. 


m 


In  Result  2,  and  the  rest  of  the  paper,  the  following  notation  is  used: 


DEFINITION:  For  any  function  f,  let 

T^YCr))  =  min  {l;4'(r)>. 

Result  1  can  be  viewed  as  a  partial  motivation  of  Result  2. 


a&riR*-" 


RESULT  2.  (Zaman  [11].)  Let  60(x)  =  ¥0([x|)x.  If  ^(f^  i  X| )  >  1)  >  0, 
then  <5  given  by  <5(x)  =  T^(¥q(  |x|  ))x  dominates  6^. 

The  next  domination  result  was  given  by  Berliner  [5].  Thr  derivation 
of  this  result  is  based  on  an  integration  by  parts  technique  for  risk 
analysis  first  introduced  by  Stein  [8].  See  Berliner  [5]  for  ?  discussion 
and  recent  references. 

For  any  differentiable  function  ip(r)  let 

*•( r)  =  . 

or 

Also,  for  all  real  n  define  the  quantities  £(V)  and  £(V)  by 

QO 

£(¥)  -  /|2pCr^?+1^2(r)exp(-  ^r2)]|exp(m)dr 


and 


?(¥)  =  [r^p+1^2(r)exp{-  ir2)exp(rn)]| , 


RESULT  3.  (Berliner  [5]).  Let  5(x)  =  VC |x|  )x  and  <5g(x)  =  ¥g(|x|)x. 
Suppose  that  both  ¥  and  are  continuous ,  piece-wise  differentiable 
functions  on  (0,»)  such  that,  for  all  real  n,  the  following  conditions 
hold:  (i)  £(¥)  <  (ii)  )  <  or,  (iii)  j£(¥)|  <  and  (iv) 

Ic(V0) I  <  ».  If 

(2.2)  ¥(r){2r¥'(r>  +  (p+l-r2)¥(r)  +  2}  > 

¥0(r){2r¥’(r)  +  (p+l-r2)¥Q(r)  +  2} 

for  all  r  >  0,  and  with  strict  inequality  on  a  set  of  positive  Lebesgue 
measure,  then  6  dominates  6g. 

2-  Main  Results.  Theorems  1  and  2  below  present  classes  of  procedures 
which  dominate  5  .  These  theorems  are  direct  applications  of  Result  3. 


4 


THEOREM  1.  Assure  p  >  6.  Let  6(x)  =  yQxpx  where  y(r)  =  (^-gCr))-1 
Suppose  that 

i)  g(r)  is  continuous,  piece-wise  different iable ,  and  non-decreasing 

ii)  g(r)  is_ not  identically  zero, 

2  2 

and  iii)  0  £  g(r)  £  min{2(p-5);  h(r  ,p)},  where  h(r  ,p)  is  given  by 

h(r2  p)  -  r2^2+3^P~3>  ~  (tr2+ll-p)2  +  32(p-5))1/2] 

2(r2+p-3) 

Then  5  dominates  6  . 

-  -  m 

Proof:  First,  note  that  6  clearly  satisfies  the  requirements  of 
Result  3  (when  p  £  3).  It  is  also  easy  to  verify  that  6  satisfies  these 
requirements . 

Step  1.  To  apply  Result  3,  let  ^(r)  =  r-2  and  ¥(r)  =  (r2-g(r))_1. 
Computing  (2.2)  in  this  case  yields 

(3.1)  (r2-g(r))-1{-2r(2r-g' (r) )(r2-g(r))“2 

+  (p+l-r2)(r2-g(r) )-^  +  2}  >_ 
r  2{2r(-  2r  ^)  +  (p+l~r2)r~2  +2}. 

Simplification  of  (3.1)  yields 

(3-2)  L '-g(r))-3{2r(g'  (r)-2r) 

+  [(p+l-r2)  +  2(r2-g(r))](r2-gCr))}  > 
r'2C(p-3)r"2  +  1], 

It  is  easy  to  check  that  g(r)  £  h(r2  ,p)  implies  that  g(r )  <  r2 
2 

for  all  r  >0.  Using  this  fact,  an  algebraic  manipulation  of  (3.2) 
implies  the  equivalent  inequality 
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(3.3) 


2rg' (r )  +  r  4g(r) {[2(p-5)-g(r) ]r4 
+  [g(r)-3(p-3)]g(r)r2  +  (p-3)g2(r)}  >  0. 

Since  g  (r)  ^  0  it  is  sufficient  to  verify  that 

(3.4)  [2(p-5)-g(r)]r4  +  [g(r)-3(p-3)]g(r)r2  +  (p-3)g2(r)  >  0. 

-S^E J.:  1st  QCg)  denote  the  L.H.S.  of  (3.4).  Note  that  Q(g)  can 
be  written  as  (suppressing  the  dependence  of  g  on  r) 

(3.5)  Q(g)  =  (r2+p-3)gz  _  r2(r2+3(p-3) )g  +  2(p-5)r\ 

Clearly  Q(g)  is  an  upward  opening  quadratic  function  of  g.  Consider 
the  smallest  root,  say  g,  of  Q(g).  The  quadratic  formula  implies  that 
b  -  Mr  ,p).  Next,  inspection  of  h(r2,p)  implies  that  h(r2,p)  >  0 
when  r2  >  0  (and,  of  course,  p  >  6).  Hence,  Q(g)  >  0  for  all  g  <  g. 
Finally,  the  dominance  assertion  then  follows  directly, J| 

The  remainder  of  this  section  is  devoted  to  the  development  of  a 
readily  applicable  version  of  Theorem  1.  The  notivation  of  this  discussion 
is  twofold.  First,  the  function  h(rZ,p)  in  Condition  iii)  of  Theorem  1 
is  rather  cumbersome.  Second,  the  specific  g  functions  to  be  discussed 
in  Section  4  are  easily  bounded  by  linear  functions  of  r2. 

THEOREM  2.  Assure  p  >  6.  Let  1  be  defined  as  in  Theorem  1. 

Suppose  that  the  corresponding  g  function  satisfies  Conditions  i)  and 
ii)  of  Theorem  1.  If 

0  £  g(r)  <_  Cp-5)min{l;r2/y*} 

^ere  y*  >  0  satisfies  h(y*,p)  =  p-5,  then  6  dominates  6  . 

— -  - - - _  m 


The  proof  is  based  on  the  following  lemma. 


LEM'IA  1.  For  p  ^  6  and  y  >_  0,  My,p)  is  an  increasing ,  concave 


function  of  y. 

The  proof  of  Lemma  1  is  a  straightforward  differentiation  argument 
and  is  omitted. 


Proof  of  Theorem  2 :  By  Theorem  1  it  is  sufficient  to  verify  that 
2 

g(r)  <_  mini (p-S) ;  h(r  ,p)}. 

2.2 

First,  by  Leirira  1,  h(r  ,p)  p-5  iff  r  y*.  Now  consider  a 

2  2  , 
graph  of  h(r  ,p)  as  a  function  of  r  .  The  formula  for  the  line  connecting 

2  .  2 
the  origin  and  the  point  at  which  h(r  ,p)  =  p-5  is  [(p-5)/y*]r  . 

2  2  2 

Therefore,  by  concavity,  [(p-5)/y*]r  <_  Mr  ,p)  for  all  r  <  y*.  j| 

We  close  this  section  with  a  few  remarks.  First,  the  limit  as 

?  2 

r  -*■  00  of  Mr  ,p)  is  3Cp-3)/2.  Lemma  1  implies  that  if  p  >  11,  Condition 

2 

iii)  of  Theorem  1  simply  reduces  to  0  <  g(r)  £  Mr  ,p).  (Since  3(p-3)/2  <  2(p-5) 
when  p  >  11 . ) 

Second,  Theorem  2  can  be  generalized  in  the  following  sense.  Theorem  2 
is  an  application  of  Theorem  1  where  the  implicit  upper  bound  on  g  is 
lowered  from  2 (p-5)  to  p-5.  The  same  analysis  could  be  performed  for 
other  upper  bounds  (moderated  by  the  remark  immediately  above)  if  desired. 
However,  Theorem  2  does  include  the  important  asymptotically  optimal 
case. 

Finally,  the  computation  of  y*  is  required.  For  convenience, 
a  partial  list  of  the  values  of  y*  (computed  numerically)  is  given  in 
Table  1. 
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Table  1.  Selected  Values  of  y* 


p 

y* 

P 

y* 

6 

7.605551 

15 

20.000000 

7 

9.123106 

16 

21.280110 

8 

10.582576 

17 

22.549834 

9 

12.000000 

18 

23.810250 

10 

13.385165 

19 

25.062258 

11 

14.744563 

20 

26.306624 

12 

16.082763 

25 

32.433981 

13 

17.403124 

30 

38.440307 

14 

18.703204 

40 

50.206556 

4.  Proposed  Control  Rules.  First,  the  following  limitation  in 

the  application  of  Result  3  is  noted.  Substitution  of  =  ¥  in  (2.2) 

forces  the  R.H.S.  of  the  inequality  to  be  (positive)  unbounded  as  r  +  0 

(see  (3.2)).  Meanwhile,  the  L.H.S.  of  (2.2)  remains  bounded  as  r  +  0 

for  any  admissible  (or  nearly  admissible,  in  the  sense  of  Result  1; 

i),  ii))  rule.  Hence,  the  differential  inequality  (2.2)  cannot  yield 

admissible  alternatives  to  6  .  In  the  case  of  Theorems  1  and  2,  this 

m 

fact  is  reflected  in  the  implicit  restriction  that  the  ¥  functions 

satisfying  the  conditions  of  these  theorems  are  bounded  from  below  by 

_2 

K | x | —  ,  for  some  constant  K,  as  |x|  -*  0.  However,  the  combination  of 

Result  2  and  these  theorems  leads  to  reasonable  alternatives  to  6  for 

m 

p  >_  6. 

PROPOSITION  1.  Assume  p  _>  6.  bat  6(x)  =  ¥(|x|)x.  Suppose  6 
satisfies  the  assumptions  of  Theorem  1  (or  2_) .  Then  5^,  given  by 

«T(X)  =  TjWlxl  ))x 

dominates  5  • 

-  m 
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Obvious,  j] 


Proof : 

REMARK:  Unfortunately,  the  rules  6  are  also  inadmissible  t  :ey 

violate  the  smoothness  requirement  of  Result  1. 

The  final  point  of  this  discussion  is  the  suggestion  of  functions 
g(|x|)  for  actual  use  in  the  application  of  Theorem  2  and  Proposition  1. 

The  functions  described  below  arise  naturally  as  parts  of  a  certain  class 
of  generalized  Bayes  rules.  The  reader  is  referred  to  Berger  [3]  and 
Berliner  [5]  for  discussions.  Only  the  required  facts  are  given  here. 

Let  n  1/2.  For  v  >  0  define  the  function  r^(v)  by 

1  , 
v/Anexp(-  ^vX)dA 

r  (v)  =  ~ -  . 

A^^expt-iv/OdA 

0 

The  following  facts  concerning  rn  are  needed  here: 

LEMMA  2 .  (Berger  [3].)  If  n>  1/2,  then 

i)  0  <  r  (v)  <  2n. 
n 

ii)  r^(v)  >  0. 
n  — 

iii)  lim  r  (v)  =  2n. 
n 

iv)  lim  r  (v)/v  =  n/(n+l). 

_  n 

\f-KD 

v)  r  (v)/v  <  n/(n+l). 
n  — 

o 

Mow,  let  v  =  a|x|  for  some  constant  a  >  0.  Next,  for  p  >_  6,  define 

r*(v)  by  r*(v)  =  rn(v)  for  n  =  (p-5)/2.  Then,  clearly,  by  Lennva  2, 

2  2  9  2  —1 

r*(a|xj  )  ■+  p-5  as  |x|  -*■  =>.  Hence,  the  rule  5(x)  =  ( |x|‘  -r*(a|xj  ))  x  is 

asymptotically  optimal. 


Also,  by  Lenma  2,  note  that 

r*(a|x|2)  <_  [(p-5)/(p-3)]a|x| 

Then,  to  apply  Theorem  2,  we  require  that 

a  <  (p-3)/y*. 

PROPOSITION  2.  Assume  p  >_  6.  Let  6*  be  given  by 
6*(x)  =  T^([|x| 2-r*(a|x|2)]~1)x. 

Then  for  any  constant  a  such  that  0  <  a  <_  (p-3)/y!'!,  6*  dominates  6^. 

Proof :  The  proof  is  a  direct  application  of  the  above  arguments , 

Lemma  2,  and  Proposition  1. 1| 

5 .  Comments .  i)  The  rules  6*  proposed  above  display  desirable  properties 

a)  They  are  relatively  easy  to  confute. 

b)  Their  behavior  for  |x[  large  is  similar  to  that  of  the  generalized 
Bayes,  admissible  rules  discussed  in  Berliner  [S3. 

c)  They  are  asymptotically  optimal. 

However,  they  are  not  admissible  since  the  smoothness  requirement  of 
Result  1  is  violated. 

ii)  A  common  criterion  for  choosing  among  decision  rules  is 
minimaxi ty .  We  simply  note  here  that  6m  is  minimax  when  p  >_  3.  Hence, 

6*  is  also  minimax.  See  Berliner  [5]  for  proofs  and  discussion. 

iii)  Another  natural  control  procedure  often  considered  is  the 
uniform  measure,  generalized  Bayes  rule  5^  (x)  =  (lt|x|  )  "'"x.  This  rule 
is  admissible  for  p  <_  4,  but  inadmissible  for  p  >_  5.  Several  authors 
have  shown  that  6  is  dominated  by  <$m  when  p  ^  5 .  Hence,  the  rules  6* 
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propose!  cLbovG  a lso  domi na t e  6^  when  p  ^  6*  For*  further  discussion, 
references  for  the  above  results,  and  another  class  of  rules  which 
dominate  6^,  see  Berliner  [5]. 
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